A Distribution Free Summarization Method for Affymetrix Genechip Data Preprocessing Analysis

نویسندگان

  • Zhongxue Chen
  • Monnie McGee
  • Qingzhong Liu
  • Richard Scheuermann
چکیده

Motivation: Affymetrix GeneChip brand arrays require a summarization step in order to combine the information in a probe set into one value representing the expression level of the corresponding gene. Here we present a new summarization method, Distribution Free Weighted (DFW) fold change, that uses the information of fold change but does not make any distributional assumptions for the data. Results: Based on spikein data sets, we compare DFW with several popular methods, via both our own calculations and the ‘Affycomp II’ competition. The results show that DFW outperforms other methods when sensitivity and specificity are considered simultaneously. In fact, the area under the Receiver Operating Characteristic (ROC) curve for DFW is nearly 1.0 (a perfect value). Furthermore, DFW can obtain all the true positives with a small number of false positives. It is also computationally faster than most methods in current use. Availability: The R package for DFW is available upon request. Contact: [email protected]

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

2 Preprocessing High - density Oligonucleotide Arrays

High-density oligonucleotide expression arrays are a widely used microarray platform. Affymetrix GeneChip arrays dominate this market. An important distinction between the GeneChip and other technologies is that on GeneChips, multiple short probes are used to measure gene expression levels. This makes preprocessing particularly important when using this platform. This chapter begins by describi...

متن کامل

Model Based Probe Fitting and Selection for SNP Array

Recent advances of high-throughput SNP arrays such as Affymetrix’s GeneChip Human Mapping 500K array set have made it possible to genotype large samples in a fast and cheap manner. A lot of algorithms were developed to call the genotypes from SNP array. When considering the low level preprocessing of SNP array, most algorithms just borrow the techniques from the gene expression microarray. As i...

متن کامل

Systematic order-dependent effect in expression values, variance, detection calls and differential expression in Affymetrix GeneChips®

MOTIVATION Affymetrix GeneChips are common 3' profiling platforms for quantifying gene expression. Using publicly available datasets of expression profiles from human and mouse experiments, we sought to characterize features of GeneChip data to better compare and evaluate analyses for differential expression, regulation and clustering. We uncovered an unexpected order dependence in expression d...

متن کامل

Parameter estimation for the exponential-normal convolution model for background correction of affymetrix GeneChip data.

There are many methods of correcting microarray data for non-biological sources of error. Authors routinely supply software or code so that interested analysts can implement their methods. Even with a thorough reading of associated references, it is not always clear how requisite parts of the method are calculated in the software packages. However, it is important to have an understanding of su...

متن کامل

Affymetrix GeneChip microarray preprocessing for multivariate analyses

Affymetrix GeneChip microarrays are the most widely used high-throughput technology to measure gene expression, and a wide variety of preprocessing methods have been developed to transform probe intensities reported by a microarray scanner into gene expression estimates. There have been numerous comparisons of these preprocessing methods, focusing on the most common analyses-detection of differ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006